Defining User Profile to Improve Knowledge Extraction in a Digital Library of Scientific Documents
نویسندگان
چکیده
Annotation is a key way in which documents grow and increase in value. This paper explores the possibility to use concepts extracted from documents by using a Natural Language Processing tool to characterize the content of digital theses. Then, using the results of the study, the paper explores the use of annotated theses in order to access to pertinent information stored in these documents and to extract knowledge by defining different user’s profiles.
منابع مشابه
Investigating the Level of Observing the Evaluation Criteria for User Interface in library services providing to the blind and deaf users in the word
Purpose: Digital library user interfaces has a determining role in desirable performance of this kind of libraries. Digital Library service providers to the blind and deaf users will have their best performance when the users (deaf and blind users) could have a proper interaction with them. This study aims to evaluate and analyze the criteria related to user interface in digital libraries servi...
متن کاملScientific Data and Document Processing in ChemxSeer
ChemXSeer is a digital library and a data repository for the chemistry domain. The data deposited into our repository is linked with digital documents to create aggregates of resources representing the links between the data and the articles in which the data is reported. ChemXSeer enables the user to annotate the data using a metadata capturing tool. The metadata is indexed and searched to ret...
متن کاملکتابخانهی ملی دیجیتال پزشکی ایران(INMDL) : بایدها و نبایدها
Iran National Digital Library of Medicine was launched in 2008 by Shahid Beheshti University of Medical Sciences in order to supply English language scientific resources for the Universities of Medical Sciences throughout the country. The Library could be accessed via www.inlm.org. Given the academic definition for national and digital libraries, it seems that the services and resources offered...
متن کاملA New Domain Independent Keyphrase Extraction System
In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams from input document. We incorporate linguistic knowledge (i.e., part-of-speech tags), and statistical information (i.e., frequency, position, lifespan) of each n-gram in defining candidate phrases and their respectiv...
متن کاملUnsupervised and domain-independent extraction of technical terms from scientific articles in digital libraries
A central issue for making the contents of documents in a digital library accessible to the user is the identification and extraction of technical terms. We propose a method to solve this task in an unsupervised, domain-independent way: We use a nominal group chunker to extract term candidates and select the technical terms from these candidates based on string frequencies retrieved using the M...
متن کامل